Author Identification from Literary Articles with Visual Features: A Case Study with Bangla Documents

نویسندگان

چکیده

Author identification is an important aspect of literary analysis, studied in natural language processing (NLP). It aids identify the most probable author articles, news texts or social media comments and tweets, for example. can be applied to other domains such as criminal civil cases, cybersecurity, forensics, plagiarizer, many more. An automated system this context thus very beneficial society. In paper, we propose a convolutional neural network (CNN)-based from articles. This uses visual features along with five-layer authors. The prime motivation behind approach was feasibility distinct writing styles through visualization patterns. Experiments were performed on 1200 articles 50 authors achieving maximum accuracy 93.58%. Furthermore, see how different volumes data, experiments partitions dataset. outperformed standard handcrafted feature-based techniques well established works publicly available datasets.

برای دانلود رایگان متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

A Comparative Study on the Technical and Visual Features of Na’in Carpets with the Emigrated Ones

Na’in carpet is an eclectic mixture of Kerman and Arak carpets’ style, which in a short time established an independent identity on the basis of indigenous components and became known as a distinct style from other types. However, the expansion of production beyond the city boundaries caused changes in it, so that in many cases, due to the apparent similarities, this carpet style has led to amb...

متن کامل

Automatic Identification of Research Articles from Crawled Documents

Online digital libraries that store and index research articles not only make it easier for researchers to search for scientific information, but also have been proven as powerful resources in many data mining, machine learning and information retrieval applications that require high-quality data. The quality of the data available in digital libraries highly depends on the quality of a classifi...

متن کامل

Author Identification using Stylometric Features

In this work we present a strategy for author identification for documents written in Portuguese. It takes into account a writer-independent model which reduces the pattern recognition problem to a single model and two classes, hence, makes it possible to build robust system even when few genuine samples per writer are available. We also introduce a stylometric feature set, which is based on th...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

ژورنال

عنوان ژورنال: Future Internet

سال: 2022

ISSN: ['1999-5903']

DOI: https://doi.org/10.3390/fi14100272